Proceedings of FDIA 2007 BCS IRSG Symposium

نویسندگان

  • Andrew MacFarlane
  • Leif Azzopardi
  • Iadh Ounis
  • Ulises Cervino Beresi
  • Alan Woodley
چکیده

This paper presents an automatic genre classification model that implements a flexible classification scheme, i.e. a scheme capable of performing zero-, oneor multi-genre assignment. I suggest that this scheme is more appropriate for genres on the web, because many web pages have often more than one genre or none at all. The model that I propose relies on the distinction between the concepts of ‘text types’ and ‘genre’, which are both ‘inferred’ and not ‘learned’ from pre-labelled examples. The main drawback of this approach is that it cannot be fully evaluated given the limitations of current genre research. However, I present a partial evaluation that shows that the model performs competitively, and remains stable when re-scaled.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

In This Issue Editorial 1 Product Review 2 " Corpora Software "

First, membership. If you are new to the IRSG, welcome you are in good company. Membership numbers have more than trebled over the past year, mainly through reaching out to audiences that the IRSG hadn't traditionally focussed on namely, the thousands of people in the BCS and further afield who have a professional (or indeed personal) interest in information search/retrieval, but aren't necessa...

متن کامل

ECIR Draft Paper Guidelines BCS Information Retrieval Specialist group 8 th November 2006

Preface On the 20 th of June, 2006 a workshop on the European Conference in Information Retrieval was held which aimed to develop a set of guidelines for authors and reviewers of ECIR papers. These draft guidelines have been complied based upon the presentations from the workshop, and have been extracted from the full workshop report, which is available to download from the BCS-IRSG website (ht...

متن کامل

Modeling Information Retrieval with Probabilistic Argumentation Systems

Probabilistic Argumentation Systems (PAS) are a technique for representing uncertainty both symbolically and numerically. It is shown that this technique, which combines symbolic logic and probability, can be used as a general model of information retrieval. PAS provide a dual (symbolic and numerical) interpretation of the logical uncertainty principle, and are a flexible model for integrating ...

متن کامل

Using Combination of Evidence for Term Expansion

Expanding a user query automatically with terms taken from documents that are most similar to the query is a reliable way of nding more relevant documents. To date most approaches to this problem have focused on modifying the query. In this paper we argue that it is useful to create a new query from similar documents, rank both the user query and the new query, and combine the evidence. We show...

متن کامل

Advances in Rule Interchange and Applications, International Symposium, RuleML 2007, Orlando, Florida, October 25-26, 2007, Proceedings

Title Type advances in rule interchange and applications international symposium ruleml 2007 orlando florida PDF rule based reasoning programming and applications 5th international symposium ruleml 2011 europ PDF advances in computation and intelligence second international symposium isica 2007 wuhan china s PDF stochastic algorithms foundations and applications 4th international symposium saga...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007